Border-Peeling Clustering
نویسندگان
چکیده
In this paper, we present a novel non-parametric clustering technique, which is based on an iterative algorithm that peels off layers of points around the clusters. Our technique is based on the notion that each latent cluster is comprised of layers that surround its core, where the external layers, or border points, implicitly separate the clusters. Analyzing the K-nearest neighbors of the points makes it possible to identify the border points and associate them with points of inner layers. Our clustering algorithm iteratively identifies border points, peels them, and separates the latent clusters. We show that the peeling process adapts to the local density and successfully separates adjacent clusters. A notable quality of the Border-Peeling algorithm is that it does not require any parameter tuning in order to outperform stateof-the-art finely-tuned non-parametric clustering methods, including Mean-Shift and DBSCAN. We further assess our technique on high-dimensional datasets that vary in size and characteristics. In particular, we analyze the space of deep features that were trained by a convolutional neural network.
منابع مشابه
A cell-type-specific defect in border cell formation in the Acacia mangium root cap developing an extraordinary sheath of sloughed-off cells.
BACKGROUND AND AIMS Root caps release border cells, which play central roles in microbe interaction and root protection against soil stresses. However, the number and connectivity of border cells differ widely among plant species. Better understanding of key border-cell phenotype across species will help define the total function of border cells and associated genes. METHODS The spatio-tempor...
متن کاملScaling up Correlation Clustering through Parallelism and Concurrency Control
Given a similarity graph between items, correlation clustering (CC) aims to group similar items together and dissimilar ones apart. One of the most popular CC algorithms is KwikCluster: a simple peeling scheme that offers a 3-approximation ratio. Unfortunately, KwikCluster is inherently sequential and can require a large number of peeling rounds. This can be a significant bottleneck when scalin...
متن کاملChromatic Clustering in High Dimensional Space
In this paper, we study a new type of clustering problem, called Chromatic Clustering, in high dimensional space. Chromatic clustering seeks to partition a set of colored points into groups (or clusters) so that no group contains points with the same color and a certain objective function is optimized. In this paper, we consider two variants of the problem, chromatic k-means clustering (denoted...
متن کاملStable iBGP through Selective Path Dissemination
In Internet, routing between Autonomous Systems (ASes) is performed by Border Gateway Protocol (BGP). Neighboring routers in different ASes share their routing information by using external BGP (eBGP), while routers in the same AS share their routing information about external destinations by using internal BGP (iBGP). iBGP employs route reflection clustering to solve the scalability problems d...
متن کاملIterative and active graph clustering using trace norm minimization without cluster size constraints
This paper investigates graph clustering under the planted partition model in the presence of small clusters. Traditional results dictate that for an algorithm to provably correctly recover the underlying clusters, all clusters must be sufficiently large—in particular, the cluster sizes need to be Ω̃( √ n), where n is the number of nodes of the graph. We show that this is not really a restrictio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1612.04869 شماره
صفحات -
تاریخ انتشار 2016